Recommending Anchor Points in Structure-Preserving Hypertext Document Retrieval
نویسندگان
چکیده
Traditional WWW search engines index and recommend individual Web pages to assist users in locating relevant documents. Users are often overwhelmed by the large answer set recommended by the search engines. The logical starting point of the hyper-document is thus hidden among the large basket of matching pages. Users need to spend a lot of effort browsing through the pages to locate the starting point, a very time consuming process. This paper studies the anchor point indexing problem. The anchor points of a given user query is a small set of key pages from which the larger set of documents that are relevant to the query can be easily reached. The use of anchor points help solve the problems of huge answer set and low precision suffered by most search engines by considering the hyper-link structures of the relevant documents, and by providing a summary view of the result set.
منابع مشابه
Anchor point indexing in Web document retrieval
Traditional World Wide Web search engines, such as AltaVista.com, index and recommend individual Web pages to assist users in locating relevant documents. As the Web grows, however, the number of matching pages increases at a tremendous rate. Users are often overwhelmed by the large answer set recommended by the search engines. Also, if a matching document is a hypertext, the document structure...
متن کاملModelling Anchor Text Retrieval in Book Search based on Back-of-Book Index
This paper proposes a probabilistic logic abstraction for modelling tf -boosting approaches to anchor text retrieval, adapted for the task of page-search in books. The underlying idea is to view the backof-book index (BoBI) as a list of anchors pointing to pages in the book. First, we model the direct application of hypertext-based tf boosting to books and show that this naive method of propaga...
متن کاملDocument Representation and Query Expansion Models for Blog Recommendation
We explore several different document representation models and two query expansion models for the task of recommending blogs to a user in response to a query. Blog relevance ranking differs from traditional document ranking in ad-hoc information retrieval in several ways: (1) the unit of output (the blog) is composed of a collection of documents (the blog posts) rather than a single document, ...
متن کاملDynamic Hypertext Synthesis for Information Retrieval
Hypertext navigation alone is insuficient for eficient Information Retrieval (ZR). Previous attempts to combine IR techniques with hypertext have been confined to the pre-authored structure of a document. In this paper we extend computer-science methods to synthesize a tailor-made hypertext document in response to each user's query. The synthesis technique can also be used to automatically crea...
متن کاملTACHIR: A Tool for Automatic Construction of Hypertexts for Information Retrieval
The paper describes the design and implementation of TACHIR, a prototype tool for the automatic construction of hypertexts for Information Retrieval. TACHIR builds up automatically an IR hypertext, a hypertext to be used for information retrieval, from a document collection, using a methodology that makes use of a set of well known Information Retrieval techniques. The structure of the IR hyper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998